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Abstract — The work establishes fundamental hmits with re- 
spect to rate, reliability and computational complexity, for a 
general setting of outage-limited MIMO communications. In the 
high-SNR regime, the limits are optimized over all encoders, 
all decoders, and all complexity regulating policies. The work 
then proceeds to explicitly identify encoder-decoder designs and 
policies, that meet this optimal tradeoff. In practice, the limits 
aim to meaningfully quantify different pertinent measures, such 
as the optimal rate-reliability capabilities per unit complexity 
and power, the optimal diversity gains per complexity costs, or 
, the optimal number of numerical operations (i.e., flops) per bit. 
. Finally the tradeoff's simple nature, renders it useful for insight- 
■ ful comparison of the rate-reliability-complexity capabilities for 
different encoders-decoders. 

Index Terms — Diversity-multiplexing tradeoff, lattice de- 
signs, rate-reliability-complexity, multiple-input multiple-output 
(MIMO), space-time coders-decoders, fundamental limits, lattice 
reduction, regularization. 

! I. Introduction 

, A. General system model 

We consider the general multiple-input multiple-output 
(MIMO) communications setting, where the m x 1 vector 
] representation of the received signal y is given by 

! y = Hx + to, (1) 

where x is the n x 1 vector representation of the coded 
transmitted signals, H the m x n channel matrix, and where 
w represents additive noise. H is considered to be random, 
having an arbitrary distribution, and being parameterized by 
p which is interpreted as the SNR (cf. UJ). w is taken to be 
i.i.d. Gaussian with fixed variance. We assume that one use 
of ([T]| corresponds to T uses of some underlying "physical" 
channel. 

The model applies to several network topologies and sce- 
narios, such as MIMO, MIMO-OFDM, MIMO-MAC, MIMO-ARQ, 
and cooperative communications, where each such scenario 
endows H and x with different structures, dimensionalities 
and statistics. This work specifically considers the non-ergodic, 
outage-limited setting, in which the above MIMO-related 
scenarios play a crucial role in improving the error and rate 
performance, though usually at the expense of much higher 
encoding-decoding computational complexity. 
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B. Motivation and general results 

Error performance and encoding-decoding complexity in 
telecommunications (cf. |2|-|6|), are widely considered to be 
two limiting, and interrelated bottlenecks. Joint exposition of 
these two aspects becomes increasingly necessary, in order to 
meaningfully quantify the ever increasing complexity costs of 
reliable communication, in systems that progressively become 
larger and more dynamic. 

A natural question then relates to establishing and meeting 
joint fundamental error-performance and complexity limits, 
optimized over all choices of encoders, decoders and policies. 
Such limits will be here described, under a high SNR approx- 
imation, in the form of an optimal rate-reliability-complexity 
tradeoff for MIMO communications. 

The limits provide answers on the pertinent measure 

(sNR, rate, reliability, complexity) , 

and do so within approximation factors which, for increasing 
p, vanish to be smaller than any p^, for any e > 0. Specifically 
these answers pertain to the following. 

« Description of the best achievable SNR-rate-reliability- 
complexity combination, optimized over all transceivers 
and policies (Theorem [Til. 
• Description of the union of all achievable SNR-rate- 

reliability-complexity combinations. (Corollary [Tall. 
« Description of the optimal value achieved by a large 
family of utility measures which quantify the SNR-rate- 
reUabiUty-complexity capabilities of transceivers, and 
which are decreasing functions of complexity and of error 
probability. (Corollary [Tbt . 

C. Structure of paper 

Sectioninirecalls the general transceiver setting, and defines 
the different performance measures. Section [Till introduces the 
asymptotic measures of performance, directly applying the 
diversity multiplexing tradeoff (DMT, ||2|) as the pertinent 
asymptotic measure of rate-error performance, and defining 
the worst-case complexity exponent as a high-SNR asymp- 
totic measure of the worst-case complexity of (reasonable) 
transceivers. Section |IV] presents the optimal high-SNR rate- 
reliability-complexity tradeoff, and the optimal transceiver 
utility value in its general form, as well as in its simpler, more 
specific, homogeneous variant. Finally Section [Vl concludes. 

II. TRANSCEIVER DESIGN AND DECODING POLICY: 
SNR, RATE, RELIABILITY AND COMPLEXITY 

A. Transceiver design and decoding policy 

Consider a sequence of transceiver designs Xp, Vp, parame- 
terized by p, where Xp C M" denotes the codebook that maps 
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information into transmitted signals, and where Vp denotes the 
decoder(s) that extract information from the received signals. 
Let the transmitted codewords x be picked, with uniform 
probabihty, from the codebook Xp. Transmission has duration 
T, SNR p, rate 



2 l-^pL 



and an enforced power constraint such that 

1 NT^ „ 



= T. 



(2) 



For simplicity we write X,T>, and we let the parameterization 
be imphed. 

Consider a policy V (short for Vp^p), which generally 
trades-off error performance with complexity, by forcing the 
decoder to limit the number of numerical operations (i.e., 
flops), up to a maximum designated number of flops. Once 
this limiting number of flops is reached, the decoder quits and 
declares an error This limiting number of flops may or may 
not be chosen as a function of the instantaneous H, y, and 
will generally depend on p. 

B. Rate, reliability and complexity 

The error probability Pxpp introduced by the specific 
X^VJ^, is simply 

Pxpp P {H, X, w : xxpp ^ x) , (3) 

where Xxpp denotes the vector decoded by 2?, under the 
restrictions of V. For a given X^V^V and a given realization 
of problem inputs H , x, w, then Nxpp{H, x, w) will denote 
the overall instantaneous introduced complexity, in flops. Then 
worst-case complexity is simply given by 



a 



xpp-^ sup Nxp.v{H,x,w) 

M ,x,w 



(4) 



A pertinent measure of performance for any X,T>J^ then 
becomes the corresponding set of achievable combina- 
tions {p, R, Pxpp,Cxpp), or its equivalent one-to-one re- 
mapping 

, ^ log Pxpp logCxDp^ 
^ \ogZ logL ' 

where Z, L regulate the refinements of the representation. 

III. Error and complexity exponents 
A. Quantifying error performance: DMT 

As a measure of rate-reliability performance, we adopt the 
refinement of the diversity-multiplexing tradeoff, identified by 
Zheng and Tse in f2l, as a fundamental performance limit in 
outage-limited MIMO communications. 

In this setting, both the error probability Pxpp introduced 
by the specific X^V^V, as well as the cardinality of X, are 
parameterized hy Z — p. Specifically the code cardinahty 



\X\ 
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is described by the multiplexing gain 
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R 



lim 
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p^oo log2 p p-i-oo T log p 
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and the associated error performance delivered by the 
transceiver and policy, is described by the limiting behavior 
of '°^iog^'^ , i-C-, by the diversity gain [S] 



(r) 



lim 



log Pxpp 
logp 
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B. Regulating and quantifying complexity performance: 
worst-case complexity exponent 

We now consider the one-to-one mapping 

log Cxpp 



Cxpp ^ 



logL 



where L is a properly chosen scaling factor, being for example 
a function of jA"!. Seeking for patterns and insight, we move to 
asymptotics where a general asymptotic worst-case complexity 
measure then takes the form 

\ogCxpp 
lim — -. p— . 

p-^oo log L 

Similai- to the DMT in ^ which measures the high-SNR 
Pxpp as a polynomial power of p, the currently chosen 
measure of complexity will also be an exponent over L — p, 
taking the form 

log Cxpp 



cxpp{r) := lim 



logp 



(7) 



We note that the chosen worst-case complexity exponent 
cxpp{r) keeps in line with the relevant behavior of most 
known transceivers, uniformly covering the full complexity 
range 

< cxpp{r) < rT 

of all reasonable transceivers, with cxppir) = correspond- 
ing to the fastest possible transceiver (requiring a small fixed 
number of flops per codeword), and with cxpp{r) — rT 
corresponding to the slowest, full-search uninterrupted ML de- 
coder^ in the presence of a canonical code with multiplexing 
gain r, i.e., with lA:"! = p'^^. In the above, the = notation is 
used when f{p) = iff (cf. 0) 



lim 



log/(p) 
logp 
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and the symbols > and < are defined similarly. 

We also note that both dxpp{r) and cxppir) quantify 
worst-case (non-ergodic) behavior, and they are both set by 
the structural properties of the design X^V^V as well as the 
statistical properties of Jf, a;, w. 

We proceed to find the optimal 
0V^xx)v{'^xpp{f')TCxpp[ry) which is equivalent to 
finding the optimal o^ip^'pp[p,R, Pxpp, Cxpp) up to a 
factor that vanishes to a number smaller than any p'^, for any 
e > 0, in the Hmit of high-SNR. 

'We here note that strictly speaking, X.'Dj'P may potentially introduce a 
complexity exponent larger than rT. In such a case though, Xy'DjV may be 
substituted by a lookup table implementation of X and an unrestricted ML 
decoder. This encoder-decoder will jointly introduce a worst case complexity 



that is a constant multiple of | A' | : 
per visited codeword is independent of p. 



It is noted that the number of flops 



3 



IV. Performance-complexity tradeoff 

We proceed to establish the fundamental limits, optimized 
over all achievable SNR-rate-reliability-complexity combina- 
tions of any transceiver and policy, up to a factor that vanishes 
in the limit of high p. 

Towards this we describe the decoder and encoder struc- 
tures, that together with a specific policy, meet a natural upper 
bound to this tradeoff, for all values of r. We start with the 
decoder, but for now disregard the policy. 

1 ) The candidate decoder - the DMT optimal LLL based 
LR-aided, regularized linear decoder: We focus on the ef- 
ficient and DMT optimal, LLL-based lattice-reduction (LR)- 
aided regularized linear decoder, presented in its general form 
in |[T], fT\, |8| for different settings, drawing from works such 
as HI, [101 . We clarify that the decoder applies to lattice codes, 
and for completeness recall the decoder's three main steps. In 
the first step, the decoder performs regularization via MMSE- 
GDFE like preprocessing, thus inducing a regularized metric 
(cf. CI) 

= arg min ||y - ifxf + (9) 

In the above, is the scaled lattice corresponding to the code, 
and T is a positive definite matrix. The above metric penalizes 
far away elements of that are generally non codewords. The 
second step includes lattice-reduction using the LLL algorithm 
ifTTl . and the last step is an efficient linear detection using, for 
example, the rounding off algorithm. 

Under standard assumptions on continuity, and in the pres- 
ence of a policy VrT that lets the decoder run its course 
irrespective of the complexity, the above decoder was shown 
in Q to be DMT optimal, i.e., that 

V 

irrespective of the lattice design X, and irrespective of the 
fading statistics. 

It is the case though that the decoder's LLL step introduces 
worst-case complexity that is infinite |12|. This problem is 
successfully addressed by the policy discussed below. 

2) The LR-based policy P^r: To limit the above infinite 
complexity, the work in 1 1 1 proposed a policy that capitalizes 
on the fact that to achieve DMT optimality, it is not required 
to LLL reduce every conceivable channel. Instead, in the 
event that too many flops occur, the policy instructs the 
implementation of the LLL algorithm to halt, and the decoder 
to declare an error. Special emphasis is given to guaranteeing 
that the event of halting is not more common than the event 
of error, thus avoiding degradation of the asymptotic error 
performance. Specifically the halting policy, to be denoted as 
Vlr, was defined on the basis of the bound on the number 
K of LLL cycles that are necessary for reduction of matrix 
M which generates the composite code-channel lattice. This 
bound is given by lfT2]| . lfT3]| to be 

K <n'^\og^ k(M) +n, (10) 

where k{M) denotes the 2-norm condition number of M. 
Based on this bound, Vlr deploys the LLL algorithm only if 

k(M) < e>0, (11) 



where dML{f) — dx,VuL,VrT{^) describes the DMT achieved 
by the uninterrupted ML decoder By showing that 

i.e., that the event of halting is less common than the event of 
error under full ML decoding, it was proven in 1 1 1 that, over 
any range of multiplexing gains r, the combination of Plrr 
and Vlr achieves DMT optimal decoding of any lattice design 
X\, and does so with worst-case complexity of O(logp). This 
implies a worst-case complexity that is at most linear in the 
rat^ at high SNR. It also constitutes substantial improvement 
over sphere decoding implementations where the worst-case 
complexity reported (see for example [141 for fast decodable 
codes flSl -ll?!) is also exponential in R, albeit with a smaller 
exponent than full search. 

3) The overall worst-case complexity exponent jointly in- 
troduced by lattice encoding, 2?lrr and Vlr-' With the above 
in mind, we proceed to establish the overall computational 
complexity jointly introduced by lattice encoding and by the 
different components of Plrr, in the presence of Vlr- 

a) Decoder and policy: We first quickly note that the 
regularization and linear-decoding steps, introduce complexity 
that is essentially independent of p, H, and bounded above by 
0{n^), thus inducing a zero complexity exponent. 

Regarding the lattice reduction step, we recall the hard 
bound 

K < log^ k(M) + n 

< n^log^ +71, e>0, 

on the number of LLL flops enforced by V^r. This bound 
implies that 

3z e M+ : P {N{H, x,w) > z log p) ^ 0, 

which in turn means that 

P (NiH^x^w) > p') ^ p-°°, 0. (12) 

In conjunction with the equivalent representation (drawing 
from ifTSl . which presents some c(r) of different XjVjV) 

c(r) = supjc:— hm < d[r)> 

p->oo log p 

of a worst-case complexity exponent c(r) that allows for d{r), 
we conclude that the LLL algorithm under Vlr, also introduces 
an effective complexity exponent equal to zero. Consequently 
the entire 2?lrr , Vlr introduces a minimal complexity expo- 
nent, equal to zero. 

b) Lattice encoding: Moving on to encoding, it is again 
easy to see that any lattice code X\ comes with encoding 
complexity that is bounded as O(n^), thus minimally adding 
to the overall complexity exponent of any transceiver/policy. 

We are now able to combine the complexities from the 
encoder and the decoder, and to provide the following. 

Lemma 1: A lattice code Aa, in conjunction with the 
decoder-policy 2?lrr,7^lr, jointly accept a minimum, over all 

^The result is extended in |8| to the MIMO-MAC case, to show that this 
optimality holds with worst-case complexity that is at most linear in the users' 
sum-rate. 
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encoders, decoders and policies, effective complexity expo- 
nent, i.e., 

c;Va,I5,»»,p,, = ini cx,-D,v = 0. (13) 

4) The overall error performance: With respect to the 
error performance of I'lrriT'lr, we utilize the result in [l] 
which proves that the DMT optimality of Plrr,7^lr, holds 
irrespective of the lattice code that it is applied to, i.e., that 
for any fixed lattice code Aa, then 

Disregarding for now issues on code design, we proceed to for- 
malize the performance-complexity optimality of I^lrri^lr- 

5} The overall effective complexity/error exponent jointly 
induced by lattice encoding, 2?lrr <^nd T'lr-' Combining ( |13t 
and (O gives the following. 

Lemma 2: The high-SNR rate-reliability-complexity trade- 
off achieved by the 2?lrr:^lri is better or asymptotically 
equal to the tradeoff achieved by any other decoder-policy, 
irrespective of the lattice code applied, i.e., 

{dDxf^.-DuLR.Vui ('^)' '^-Da^a.ijlrr.t'lr 

= (sup d^-^ (r ) , inf ca'a ,1? ,P (r) ) . 

Here it is stressed that this achievable tradeoff may be sub- 
optimal, as it is limited by the reliability capabilities of the 
specific code Aa. 

What remains now is to combine the optimal components 
I'lrrj^lr, with suitable code designs. 

6} Employing DMT optimal codes, to meet the rate- 
reliability-complexity tradeoff: We have just seen in Lemma|2] 
that, given any lattice design Aa, the combination X'lrrjT'lr 
achieves the highest allowable tradeoff over any transceiver- 
policy that includes X\. Consequently what remains is to 
identify lattice code designs that optimize both cx,^,'d,v{i') 
and d X A, v,!^ {'''), in the presence of Plrri^lr- Optimiz- 
ing of cxj^.v.v{t) has already been achieved in Lemma [T] 
which proved that any lattice design Aa gives cxa,Vi,^^.Vlr ~ 

inf cx w — 0. Hence what remains is to find a lattice de- 

x,v,v ' ' 

sign that optimizes dxA,v,v{f), in the presence of I'lrr, T'lr- 
This in turn is further simplified in the presence of (fT4l l. and 
the task is now limited to simply finding DMT optimal lattice 
codes, i.e., codes that asymptotically meet the outage region 

O = {H : ^ logdct(/ + I3HH^) < i?}, some fixed (3, 

of the equivalent MIMO channel to achieve asymptotically 
optimal performance (cf. |2|) 

4pt(r) := sup dxvA^) = P{H £ O). (15) 
xpp 

The existence of such lattice codes has been proven in |fT9), 
for the quasi-static Rayleigh fading channel, and a unified 
family of such codes was explicitly constructed in Il20l using 
cyclic division algebras (CDA). Further such codes have, over 
the last few years, been described for a plethora of MIMO 
models. These codes are based on different variants of CDA 



codes (cf. II2TI . II22I ). and have been shown, under basic con- 
tinuity conditions, to provide DMT optimality for all channel 
dimensions, and most often for all fading statistics. Such codes 
can, for example, be found in EO], ||23-(|29], and they DMT- 
optimally apply to several MIMO scenarios, including MIMO, 
MIMO-OFDM, MIMO-MAC (Rayleigh fading), MIMO-ARQ, as 
well as to most existing cooperative communication protocols. 

For all the above MIMO scenarios, we have now the final 
result, which holds under basic continuity conditions. 

A. The optimal tradeoff 

Theorem 1: The high-SNR optimal, over all encoders, de- 
coders and policies, rate-reliability-complexity behavior is 
given by 

°V^xpp{dxpp{r), cxpp{r)) 

= (rfA'cDA,CLRR.-PLR('')>CA-CDA,CLRR,PLR(^)) = (rfopt(r-),0) (16) 

and is achieved for all multiplexing gains, all channel di- 
mensions and (in most known cases) all fading statistics, by 
the CDA-based designs Xqda, the LR-aided regularized linear 
decoder 2?lrr, and the LR-based policy Vlk- 

Equivalently the result shows that the achievable rate- 
reliability-complexity combination 

(p, R = r logp, P = p-'^"P'('-), C = p") (17) 

is optimal, up to a factor that asymptotically becomes 
smaller than any p% for any e > 0.. We quickly note that 
-^CDA, ^^LRR, T^LR is Currently the only known tradeoff-optimal 
design. 

Directly from the above, we have the following. 

Corollary la: In the high SNR regime, the union of all 
achievable SNR-rate-reliability-complexity combinations, con- 
sidering all reasonable X,T)J^, is given by 

< d{r) < dopt(r),0 < c(r) < rT. 

Proof: For a given R, any of the above reliability- 
complexity pairs can be achieved by employing an X,!),!^ 
that is optimal with respect to ST% . modifying though P 
to introduce the appropriate amount of extra complexity and 
errorfl ■ 

Finally, using the fact that the complexity of the optimal 
transceiver is O(logp), it is easy to show that for several 
MIMO settings, optimal DMT performance is achieved with 
at most 0{n^) flops per bit. 

a) Optimal limits on general reliability-complexity func- 
tions: Another measure of the rate-reliability-complexity ca- 
pabilities of different transceivers can take the form of general 
utility functions. Towards this we define the following. 

Definition 1: Let F be a weighting function that is increas- 
ing in dxpp{r), decreasing in cxpp{r), and which reflects 

^Constructing such modification is trivial. We note tliat tlie worst case 
(r, d{r) = 0, c(r) = rT) corresponds to a full-searcli transceiver tliat 
provides subexponential decay of the probability of error, for increasing SNR. 
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the different costs assigned separately to erroneous detection, 
and complexity. Then we use 

Dxpp{r) ■■^T{dxppir),cxppir)), (18) 

to denote the T-geneml rate-reliability-complexity limit, for a 
given X,V,P. 

Towards motivating meaningful use of the limit, we identify 
the following simple manifestation as one of many special 
cases of the general limit. 

Definition 2: The homogeneous rate-reliability-complexity 
limit for a given X,T>J^, and a given weighting factor 7 > 0, 
takes the form 

Dxpp{r) := dxpp{r) - jcxppir), (19) 

and describes the diversity gain minus the normalized com- 
plexity cost. 

It is interesting to interpret the rate-reliability-complexity 
limit Dxpp{r), as a limit that describes the high-SNR error 
capabilities of A", 2?, 7^, per unit power and complexity. Equiv- 
alently, the limit may be described as a measure of diversity 
gain per complexity order 

The following result, which holds under basic continuity 
conditions, for the same scenarios as Theorem [T] describes the 
optimizing value achieved by a large family of measures F. 

Corollary lb: The optimal, over all encoders, decoders and 
policies, F-general rate-reliability-complexity limit D{r), is 
given by 

D{r) = r( sup dxppir),0) = r(dop,(r),0) (20) 
xpp 

and is achieved for all multiplexing gains, and all channel 
dimensions by the CDA-based designs Xqda, the LR-aided 
regularized linear decoder 2?lrr, and the LR-based policy V^r. 
Proof: The proof is direct by noting that 

D{r) = sup T(dxpp{r),cxpp{r)) (21) 
xpp 

< r( sup dxpp{r), inf cxpp{r)), (22) 
xpp xpp 

and then by applying Theorem [T] ■ 
The following holds for the more intuitive, cost-symmetric 
version of the limit. 

Corollary Ic: The optimal, over all encoders, decoders 
and policies, homogeneous rate-reliability-complexity limit, is 
given by 

D(r) = sup dxpp{r) -~jcxpp{r) = dopt{r). (23) 
xpp 

V. Conclusions 

The tradeoff and its achievability, provide worst-case guar- 
antees on the complexity required for provably optimal perfor- 
mance in outage-limited MIMO communications. The guaran- 
tees hold over a surprisingly broad setting, and they come with 
reduced transmission energy and delay, as well as reduced 
algorithmic power consumption and hardware. The tradeoff 
concisely quantifies these guarantees and the capabilities of 
different transceivers, as well as quantifies the role of policies 
in simplifying algorithms which would otherwise introduce 
unbounded complexity. 
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